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MOBIUS DISJOINTNESS FOR ANALYTIC SKEW 

PRODUCTS 

ZHIREN WANG 


Abstract. We show that the Mobius function is disjoint to every ana¬ 
lytic skew product dynamical system on over a rotation of the circle. 
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1. Introduction 

Let h he a continuous map from the circle = M/Z to itself. For 
a G [0,1) we consider the skew transform on given by 

T{x, y) = {x + a,y + h{x)). (1.1) 

We show the following Mobius disjointness statement: 

Theorem 1.1. Suppose /i : i—>■ is analytic and T is as above. Then 

2 X T{n)f{T^{xo,yo)) ^ 0, V(xo,2/o) G V/ G C\T^). (1.2) 

n<N 


We emphasize that there is no assumption on a. 

The Mobius function is defined by pL{n) = (—1)^ if n is the product of k 
distinct primes and y{n) = 0 otherwise. An important theme in dynamical 
system and number theory during the recent years is the randomness of //, 
which is characertized by Sarnak’s Mobius Disjointness Conjecture: 


Conjecture 1.2. Par09 If X is a compact metric space, T : X X is a 

continuous map with zero topological entropy, then Yln<N f i^)) 

0 for all continuous functions f on X and all x G A. 
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An important feature of the conjecture is that disjointness is expected 
to hold for every x instead of for almost every x with respect to some T- 
invariant probability measure. 

The case where X is finite is equivalent to the prime number theorem in 
arithmetic progressions. And the case where T is a rotation on the circle is 
Davenport’s theorem | Dav37 | that Yln<N IJ'{n)e{an) = o{N) uniformly for 
all a. 

Many special cases of Sarnak’s Conjecture have been established. To 
list a few: | MRl[)| , |Grel2 , GT12| , BoulSa, Boul3h| , BSZ13| , ^PL13| , EALdlR14, 
MMR14, Pecl5]. The majority of these results rely on Vinogradov’s bilinear 
method, or one of its newer variants: either Vaughan’s identity [Vau77] or 
the Bourgain-Sarnak-Ziegler criterion [ BSZ13|] . 

It should be remarked that in all the above cases, the dynamical system 
is regular in the sense that for every point x, ^ Yln<N ^T^(x) converges to 
some T-invariant probability measure on X in weak-* topology. 

Skew products on provide the simplest examples of irregular dynamics. 
In the case of skew products, for Diophantine rotation numbers a, 

which have full Lebesgue measure, the dynamics is topologically conjugate to 
an affine transformation, hence is regular and Conjecture 1.2 holds. Kulaga- 
Przymus and Lemahczyk showed in [ p<!PL13[ | that this indeed holds for a 
much larger topologically generic set of a. However, Furstenberg constructed 
in [ p7ur6I| | a counterexample showing that for some a and some analytic 
function h, the dynamics of T is not regular. 


In | LS15 |, Liu and Sarnak established Conjecture O for a class of analytic 
skew products of the form (0). This was a remarkable achievement because 
their result covers a topological conjugate of Furstenberg’s counterexample, 
and therefore it was the hrst time that Conjecture o was proved for an 
irregular dynamical system. The hypothesis in [|LS15| was that the Fourier 
coefficents h decay with exponential bounds both from above and from be¬ 
low, i.e., < \h{m)\ < for some ri,T 2 > 0. The upper bound 

corresponds to analyticity, while the lower bound was posed in order to get 
extra cancellation using van der Corput’s method of exponential sums. 

proves Conjecture 


The main result in this paper. Theorem 1.1 


for 

every analytic skew product on without assuming a lower bound for 
\h{m)\. 

Our proof is a mixture of dynamical and number-theoretical arguments. It 
is based on a dynamical dichotomy due to Furstenberg | Fur61 ], and uses the 
Bourgain-Sarnak-Ziegler criterion from |BSZ13|. The main new ingredient 
is the recent estimate by Matomaki-Radziwill-Tao |MRT15] on averages of 
multiplicative functions in short intervals. 

In |Fur61], it was shown that T either is uniquely ergodic or preserves an 
invariant measure that is a finite extension of the Lebesgue measure in the 
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X coordinate. We apply this to a two-parameter family of maps of the form 


pi-l P2-1 

{x,y) ^ (x + a,y + E h{xo + pix + la) — h{xo + p 2 X + la)^ 
1=0 1=0 


where {pi,P 2 ) are pairs of distinct primes. 

If unique ergodicity holds for all pairs of primes, it will follow that there 
is no correlation between the sequences /(T^i"'(xo, yo)) and f (TP^^{xQ,yo)) 
and the Bourgain-Sarnak-Ziegler criterion allows to conclude B- 

In the case where unique ergodicity fails for at least one pair (^ 1 ,^ 2 ), 
using Fourier series, we are able to show, after several steps of dynamical re¬ 
ductions, that in order for (^) to fail, the sequence n i->- f{T'^{xo, yo)) must 
be well approximated by a periodic sequence of period Q in every interval of 
length 1. The analyticity assumption allows to make I exponentially large in 
terms of Q, while Q can be arbitrarily large itself. Then we decompose, in 
each interval of length I, the periodic sequence into a linear combination of 
Dirichlet characters x of modulus Q. This reduces the problem to control¬ 
ling the average of the multiplicative function fi{n)x{n) on a typical interval 
of length 1. 

Matomaki, Radziwll and Tao proved in [ IVIRT15 ] that for a non-pretentious 
multiplicative function u and X ^ / S> 1, on most intervals I C [X, 2X) of 
length I the average of v is very small. The function px is known to be non- 
pretentious, so this theorem applies in our setting. Moreover, the condition 
that I is exponentially large compared to Q assures that the bound does not 
explode when different px's are combined and gives the desired disjointness. 

In fact, our method easily extends to all h satisfying \h{m) \ <C . 

But we will focus on the analytic case in this paper. 


Notations. We will write m^d for the Lebesgue probability measure on T'^, 
and e{9) for the function The norm ||0|| of 0 € is the distance from 

9 to 0. This norm extends naturally to 0 £ R, that is, ||0|| = dist(0,Z). A 
basic fact that will be used repeatedly is ||0|| ^ e{9) — 1 ^ ||0||. 


Acknowledgments. I am deeply grateful to Peter Sarnak for encourage¬ 
ment and insightful discussions. Thanks are also due to Joanna Kulaga- 
Przymus and Mariusz Lemahczyk for valuable comments on an earlier draft. 
This work was carried out during a research stay as a member at the Institute 
for Advanced Study, and I would like to thank the IAS for its support. This 
work was also supported by NSF grants DMS-1451247 and DMS-1501295. 

2. Existence of a coboundary 
Remark that T^'ix, y) = {x + na, y + H{n, x)) where 

n—1 

H{n, x) = h{x + la). 

1=0 


( 2 . 1 ) 
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One useful property is: 

n\ — l 

H{nin2,x) = "^ H{n2,x+ ln2a). ( 2 . 2 ) 

1=0 

Remark 2.1. By density of trigonometric polynomials in it suffiees 

to check for every f{x^y) = e{f,ix+^ 2 y), ^ 1,^2 ^in order to establish Q. 


In this section, we make that the following reduction. 

Lemma 2.2. For a continuous map /i : i—>■ if Q) fails, then a is 

irrational, and there exist: 

• s G N; 

• a pair of distinct primes pi, p 2 ; 

• a measurable map 5 : : i-A 

such that 

s{H{pi,xo+Pix)-H{p 2 ,xo+p 2 x)) = g{x + a)-g{x),for uiji-a.e.x. (2.3) 
Here the map H is defined by (|2.lD . 

Two tools are needed to prove this observation. The first is the Bourgain- 
Sarnak-Ziegler criterion for Mobius disjointness, which is an extension of 
Vinogradov’s bilinear method and Vaughan’s identity, and eliminates the 
need of a saving of degree log~^ N in the hypothesis. 


Remark 2.3. As noted in |BSZ12], the Bourgain-Sarnak-Ziegler criterion 
is a dynamical interpretation of the bilinear method, which turns the study 
of type II sums into that of joinings between and where pi, p 2 are 
different primes. In the case of sk ew produ cts (1.1), Kulaga-Przymus and 
Lemahczyk studied such joinings in | KPL13| as dynamical systems onT^. A 
simple observation is that to study the correlation in Proposition 2^ below, 
it suffices to study a certain T'^-factor of the dynamics on which would 
deduce Lemma |^.4 from ||KPL13 , Theorem 2.5.2]. We include the proof of 
Lemma \2.2i for completeness. 


Proposition 2.4. |]BSZ13(| Suppose two functions F, 1 / : N i-A C satisfy 
l-^l < 1, F 1 and V is multiplicative. If for any pair of distinct primes 

Pi, P2, _ 

^ F{pin)F{p 2 n) = o{N), 


n<N 


then 


n{n)F{n) = o{N). 


n<N 


The second tool we need is the following dynamical dichotomy proved by 
Furstenberg. 


Proposition 2.5. | Fur61 | Suppose {Q,o,Tq, yo} is uniquely ergodic topo¬ 
logical dynamical system with yo being the unique ergodic measure, and 
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/i : r^o I—>• is a continuous function. Let T : Oq x i—O q x be 

defined by T{x,y) = {To{x),y + h{x)). Then exactly one of the following is 
true: 

(1) T is ergodic with fj.Q x m-j-i being the unique ergodic measure. 

(2) There exists a measurable map (7 : ri i-A and a non-zero integer s 
such that sh{x) = g(To{x)) — g{x) for fi-almost every x. 

Notice that in any case, //q ^ RJti is T-invariant. And in case ( 2 ), for any 
yo € T^, the closed set {{x,y) : k{y — yo) = f{x)} is T invariant. As this 
set is a measurable A:-fold cover of blo) h naturally supports a T-invariant 
probability measure that projects to fio and assigns equal mass to k distinct 
points in each fiber along y-direction. 


Proof of Lemma \2fi . Suppose that ( |L^) f ails for some /. The a must be 
irrational (see e.g. [ LS15|] ). By Remark pIH we can assume f{x, y) = e(^ix+ 
^ 22 /)- Furthermore, ^2 cannot be 0, as otherwise 

X] T{n)f{T'^{xo,yo)) = ^ /x(n)e(^i(xo+na)) = e(^ixo) ^ y{n)e{nfia), 
n<N n<N n<N 

which is uniformly o{N) for all a by Davenport’s estimate Dav37|| . 

By Proposition |2^ there are distinct primes pi, p 2 such that 


(2.4) 


1 ^ f{TPnixo,yo)f{TP2n)(xo,yo) ^ 0 . 


n<N 


By (|2.1|) , the left hand side is equal to: 

^ e{^i{xo + Pina) +6(yo + H{pin,xo))) 


N 

1 

'N 


n<N 


e( - ^i{xo-k P2na) - 6(2/0 + H{p2n,xo))) 
e[f.i{pi -p2)na + ^2{H{pin,xo) - H{p2n,xo))'^ 


n<N 


f {( P ^ - P2 ) na , {H{pin,xo) - i?(p 2 n, xq))) ■ 


(2.5) 


n<N 

Define a new dynamical system of the form ( | 1 . 1 D by 

f{x,y) = [x + a,y + H{pi,xo+pix) - H{p 2 ,xo +P 2 x)). 

We claim that 

T"(0, 0) = {na,H{pin, xq) - H{p 2 n, xq)) 

for all n > 0. In fact, this is true for n = 0. Assume for induction that 
holds for n, then T”^+^( 0 , 0 ) equals 


( 2 . 6 ) 


na + a,H{pin,xo) - H{p 2 n,xo) + H{pi,xo -Lpina) - H{pi,xo pina 
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Because by (2.2), for all p € N, 


n—1 

H{pn, xo) + H(p, xq + pna) = H{p, xq + Ina) + H (p, xq + pna) 

1=0 

n 

= '^H{p,xo + Ina) = H[p{n + l),xo), 
1=0 


It follows that, for f{x,y) = /((pi -p 2 )x,y), 


(2.6) holds for n + 1 as well. 


0 ) 


1 J^/(T”(0,0)), 

n<N 


(2.7) 


This implies that m'j ’2 cannot be the unique ergodic probability mea¬ 
sure for T. Suppose otherwise, then any weak* limit of a subsequence of 
0 )}^=!’ which is a T-invariant probability measure, must 
equal m’j' 2 . Because the space of probability measures on is weak* com¬ 
pact, -^Ylin<N ^f^{oo) ni']r 2 . By Birkhoff’s ergodic theorem, ( ^ ) con¬ 

verges to 


f{x,y)dxdy = / / /((pi -p 2 )a;,p)dxdy = / / /(x,p)dxdy 

{Cix)dx- J e{^ 2 y)dy, 


= / e 


which equals 0 because ^2 7^ 0. This contradicts _ 

Since a is irrational, x e->■ x -|- a is uniquely ergodic on T^. By applying 


Proposition 2.5 to the map T, we know that there are s G N and a measurable 
map g : 1 —> for which (2.3) holds for m-j-i-almost every x. □ 


Recall that the degree of a continuous map /i : 1 —>■ is the image 

d = G 7ri(T^) = Z, where 1 is the identity in 7ri(T^). 

Corollary 2.6. If a Lipschitz continuous map /i : i-A makes (|1.2|) fail, 

then h is homotopically trivial. 


The corollary follows from |KPL13, Remark 2.5.7], we again give the proof 
for completeness. 


Proof. Recall first a few simple facts: 

(1) If h has degree d, then so does x i->- h{x -|- a) for all a G M. 

(2) If h has degree d, then x i-A h{ax) has degree ad for all a G Z. 

(3) If hi and /12 respectively have degrees di, d 2 , then the degree of 
hi + /i 2 is di -I- d 2 . 
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It follows that if h has degree d, then the map 
H{pi,xo+pix) - H{p2,xq+P2x) 

pi-l P2-1 

= ^ h{xo +pix + la) - ^ h{xQ +P 2 X + la) 

1=0 1=0 

has degree (pf — p^)d. 

Assume, in order to get a contradiction, that d 7 ^ 0 , then for any two 
different primes pi, p 2 , H{pi,xo + pix) — H{p 2 ,xo + p 2 x) is homotopically 
non-trivial. Since h is assumed to be Lipschitz, by [ Fur61| , Lemma 2.2], there 
is no solution to (2.3). So, by Lemma El>(OD must hold. This contradicts 
the hypothesis and concludes the proof. □ 

Remark 2.7. If h is as in Corollary \2.d^ then it can be realized as a Lipschitz 
function from M/Z to M. In particular, one can talk about its Fourier series, 
which pointwise converges to h because of Lipschitz continuity, i.e., for all 
x G 

h{x) = ^ h{m)e{mx). ( 2 . 8 ) 

3. Eliminating non-resonant frequencies 


From now on, we suppose that (1^) is not true. In addition, assume that 
h is analytic, and hence ( |2.8D holds and there exists r > 0 for which 

|Mm)|<e-"H^ (3.1) 

where the implied constant may depend on h and r. 

Recall that a must be irrational in this case. Let be the convergents 
given by a’s continued fraction expansion [ 0 ; ai, 02 ) • ‘ ‘ 

Remark 3.1. We list below a few fundamental properties of approximation 
of a bypk,qk (see pChiOTj j.- 

(■1) Pk+l — OikPk T Pk—1; Qk+l — CLkQk T Qk—l and (pki^k) — 1) 


( 2 ) 


< hkOtW < 


'Jfe+l+'Jfe ' "'^'^“■11 ' qk+i 
(3) If for integers m and n, \a — 


< 


then — = IL for some j. 


2m- 


For all 61,62 G N, define 

MbiM = U 

k:gk+i>e^'^k 

qk>bi 


{m: qk< \m\ < b 2 qk+i and qk\m} 


(3.2) 


The following lemma is similar to | LS15| , Lemma 4.1]. 

Lemma 3.2. Suppose /i : i-)- M satisfies ( p.!]) . Then for all 61,62 G N, 

1 


the series 


E 


h{m) 


e{ma) — 1 


e{mx) 


62 ^ 10 } 

converges uniformly and hence defines a continuous function 4>{x). 
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Proof. Since C it suffices to prove for 62 = 1- If m ^ Mf,^^iU{0}, 

then at least one of the following three cases holds: 

(1) \m\ < mingj.>b^ q^. Since only finitely many frequencies are involved 
in this case, they can be ignored without affecting the convergence. 

(2) Qk < \m\ < Qk+i, Qk t l"^l' case, we claim that ||ma|| > 2 ^\- 

By Remark ||ma|| > ^ unless: \m\ = Iqj and ||mQ;|| = \ma — lpj\ for 
some j < k and I € Z\{0}. In the later case, by the assumption on m, j 
must be less than k, and we still have 


|ma|| = |Z| • \qja — pj\ = |/| • \\qja\\ > 
It follows that 


E 


< 


qk<\m\<qk+i 

qk\m. 

E ( 

qk<\m\<qk+i 

qk\m 


h{m) 


(Ij+1 + Qj 

1 


1 11 

>-> — > —• 

qk + qk-i “^Qk 2m 


e{ma) — 1 


e{mx) 




m 


1)5 2 £ 


(3.3) 


m=qk 


<e 


-v'i'fe 


(3) qk < \'m\ < qk+i < £ 2 '^'= and ( 7 fc|m. In this case, \m\ = with 

< a||gfca|| < and 


1 < a < Ofc. By Remark 3.1 


9A:+l+9fc 


thus a||gfcQ:|| = ||ma||. Therefore, because qk+i < 62 '?'= 

1 


E 


qk<\m\<qk+i 

qk\m 


h(m) 


e{ma) — 1 


e{mx) 


I °° 

« J] (e--H. g^+i + gfc .i)«2^, 


-Taqk 


62 




(3.4) 


|m|=aqfc 

l<a<afc 


a=l 


Since both ( ^.3| ) and (^) are convergent when summed over all the g^’s. 
The lemma follows. □ 

In the next reduction, we eliminate the non-resonant frequencies, i.e., 
those not in 


Corollary 3.3. Suppose h is as in Lemma \3 .4 For arbitrary 61,62 S H Ft 
hi{x) = EmGM,j, 62 ^ 10 } Km)e{mx). Then: 


(1) T satisfies {\fi2) if and only Ti{x,y) = {x + a,y + hi{x)) does; 

(2) For distinct primes pi,P 2 , h satisfies ( |2.3|) for a measurable map 

g( : i-A z/ and only if hi satisfies the same condition with 

respect to another measurable map 52 - 
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Proof. Write h{x) = hi{x) + h 2 {x), where hi and /12 are partial sums from 
the Fourier expansion of h, respectively over and its complement. 

Let (/)(x) be given by Lemma |3.2|. Then 


(j){x + a) — 4>{x) = h{m) 




e{ma) — 1 


{e{mx + ma) — e{mx)) 


= E 

(,2 U{0} 


h{m) 


1 

e{ma) — 1 


(e(ma) — l)e(mx) 


=h2{x). 

Denote d>(x,y) = {x,y — (j){x)). Then = (x,y + (/>(x)), and 

oTi o d>)(x,y) =($“^ oTi){x,y- (fix)) 

=<h“^(x + a,y - (fix) + hiix)) 

= (x + a,y - (fix) + /ii(x) + (fix + a)) 

= (x + a,y + /ii(x) + /i 2 (a;)) = (x + a, y + (x)) 
=r(x,y). 


Therefore, /(r”(xo,yo)) = /i(r{'(xi, yi)), where fi = fo^ ^ and (xi,yi) = 
$(xo,yo)- This shows claim ( 1 ) 

As for claim ( 2 ), define Hiip,x), H2ip,x) as in ( 13 ) respectively using 
hi and /i2• Then for any p, Ff(y, xq + px) = Hiip, xq + px) + H2ip, xq + px). 
However, H2ip,x) = i^(fix + il + l)a) — (fix + la)) = (fix+ pa) — (fix). 

Therefore, 

s(iF(yi,xo +yix) - Hip2,xo + P2x)) 

- siHiipi,xo+pix) - Hiip2,Xo+P2x)) 
=s(H2(pi,Xo +Pix) - H2ip2,Xo+P2x) 

=(fixo +px + pa) - (fixo + px) = g2ix + a) - g2ix). 

where g2ix) = 4 >ixo +px). So (p. 3 D holds for H and g if and only if it holds 
for Hi and gi with gi = g — g2- El 


4. Almost linearity of orbit along arithmetic progressions 


The aim of this section is to show the orbit point T"'(xo,yo) £ varies 
in an almost linear way when n assumes value in an arithmetic progression 
of step length q^. 


Lemma 4.1. Suppose an analytic function 1— >■ M satisfies (3.1), and 

there are 61 > 62 > 1 such that h is supported on U {0}. Then the 

estimate 


Hiqk,x) - qkhiO) 


< e 


-jQk 


holds uniformly for all x G and for all k satisfying qk+i > ea'?'', where 
the implied constant depends on h, r and hi, 62 . 
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Proof. Using geometric series, one easily finds out that for all n € N, 


n—1 n—1 

H{n,x) = ''^^h{x + la) = EE h{m)e{mx + Ima) 

1=0 1=0 md'L 


=n/i(0) + ^ h{'m) 

m^O 


e{nma) — 1 
e{ma) — 1 


e(mx). 


(4.1) 


Because h is supported on U {0}, 


H{qk,x) - qkh{0) 


.e{qkma)-l 

y n[m )—^;- —e(mx, 




e{ma) — 1 


h Mqkma)-l 

n[m )— - --- —e[mx) 


e{ma) — 1 


s E E 

qj>bi 

h Maqjqko)-! 
n[aqj) —^^- — eyaqj x 


E E 


j-.qj+l>e^y 

qj>bi 


e{aqja) — 


= : ^ \H,,k{x)\. 

j:qj+l>ei'^J 

qj>bi 

(4.2) 

We estimate \Hj^k{x)\ separately according to j < k, or j > k. 

(1) j < k. Remark a||gja|| < OQ^+i < ^2aj97+i < ^29)"^ - < 1- 

This implies ||a(?ja|| = a||gjQ!|| > a(gj+i + qj)~^. 

On the other hand, ||agjgfca|| < ag'j||gfcQ!|| < aqjq'^^.^. By these estimates 
and the analyticity assumption e, 


Hj,kix) \ < 


b2aj 


a=l 


Qgjgfc+i 
a{Qj+i + qj)~^ 


< < ^j+Plklv 

a=l 


Thus by the assumption qk+i > 6 2 '^'=, 


fc-i 

\Hj,kix)\ < ^ 

j<k:qj+i>e"^‘‘i 
qj >bi 

(4.3) 

Where we used that ^ which is due to the basic fact that 

qj +2 > Qj+i + Qj > for all j. 
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(2) j > k. In this case, we use the trivial estimate that 
This implies: 

b2aj 


and thus, 


e{aqjqka)-l 
e(aqja) — l 


< Qk- 




a=l 


9=<?fe 


qj>bi 


The lemma is established by combining ([4.2D, ([4.3[) and (4.4) 


(4.4) 


□ 


5. Almost periodicity of orbit 

We know from the previous lemma that when n moves along arithmetic 
progressions of step length q^, the y-coordinate of T'^{x, y) is close to a linear 
sequence in of slope h(0), which is the average value of the function h. 
Our next goal is to control this average. Unlike the degree d, /i(0) cannot be 
fully eliminated. However, it exhibits similar Diophantine patterns to those 
of a. 


Lemma 5.1. Suppose h is as in Lemma ^.1, and o is not true, then 
there exists tj € S gN, such that for any given d > 0.' 

For all sufficiently large k with Qk+i > £ 2 '?'= and any positive integer 
a < , 


||oS'Q'fc/i(0)|| < 6. 


Proof. By Lemma 2.2, there exists s G N and distinct primes pi < P 2 , and 
a measurable map y : i—>• such that ( p.3[ ) holds. 

By Lemma 3^, after replacing h by Ylm^Mp iU{o}(®)’ (E!l) holds for 

the same pi, p 2 and a different g. In addition, h{0) does not change by this 
modification. So we can assume h is supported on U {0}. 

By Luzin’s theorem, there is a compact set H C of Lebesgue measure 
at least 0.9 on which g is continuous. There is e such that if ||a; — x'\\ < e 
and x,x' G Vt then \\g{x') — g{x)\\ < |. Without loss of generality, we will 
assume e < |. 

For r] < j, under the assumptions on k and a, a < e~i^*‘qk+i, and thus, 
\\aqka\\ < a\\qka\\ < e"^'"=yfc+i • 

and is less than e when k is sufficiently large. Consider the pair of points x 
and X + aqkO, they both belong to H if x lies in = H n (H — aq^a). As 
m']pi(H — aqkOi) = m']pi(H) > 0.9, we know Ll' is non-empty. 


Write ijj{x) for g{x -|- a) — g{x), which by (2.3) equals s(H(pi, xq +pix) — 
H{p 2 ,xq + P 2 x)) for almost every x G fl'. 
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Fix hereafter such a generic point x G then 


\\g{x + aqko) - g{x)\\ <-. (5.1) 

We apply ( [l.lD to get the Fourier expansion of "0: 

pi P2 

= s h{m) ^ e{lma + mxo +pimx) — e{lma + mxo +p 2 ^Tix)^. 


In other words, 


( Pi P2 \ 

^ e{lma + mxo) — ^ e{lma + mxo) j , 


i=o 


z=o 


with the convention that /i(0) = 0 if 0 ^ Z. 

This implies several properties of if:. First, as h satishes (3.1), 


(m)| < s Pi 


h(^) 

piJ 


■Pi 


H-) 

\pij 


— — \m\ 

■C e P 2 ' ' 


with an implied constant depending on h, s and p 2 - 
Second, 


(5.2) 


(5.3) 


suppi/^ C PlMp2^l Up2Tip2,l C Mp2^p^ U Mp2^p2 — iWp2,P2’ 
Furthermore, 

-0(0) = s(pi-p2)^(0). (5.4) 

Construct T(n,x) as in (U with h replaced by 0, i.e. T(n,x) = 

Er=iV(x+M- 

Thanks to ( ^ ) and (^), we can apply Lemma O to T to assert that, 
uniformly for all x G T^, 

|T(gfc,x) - qk^{0)\ < (5.5) 

On the other hand, notice that 

a—1 

g{x + aqka) - g{x) = T(agfc,x) = '^'i/{qk,x + lqka). (5.6) 


1=0 


From ( |5.1|) , ( ^)5| ) and (^), we deduce 
agfc0(O) 


a—1 


a—1 


<1 


'^^{qk,x + lqka) - agfe0(O) + '^^{qk,x + Iqta) 


1=0 

a—1 


1=0 


< ^ |T(gfc,x + Iqka) - aqfc0(O)| + \\g{x + o^a) - fi((x)|| 
z=o 

<Cae~^'^^ + 

where C is a constant that depend only on h, s and p 2 . 
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Choose r] < Then aqki’{0) < C ■ ■ e 4 p2 '^'' + | is less than 6 

for sufficiently large k. 

However, by (|5.4|) , ||ogA:'0(O)|| = \\aqks{pi — p2)h(0)||. So the lemma is 
true with S = s{pi — P 2 )- C 


This, together with Lemma 4.1, shows that an orbit of T is, within inter¬ 
vals of length up to Sque^^^^^, almost periodic with period Sqk- 


Corollary 5.2. Let p and S be as in Lemma 5jJ_. Then given any 5 > 0, for 
all sufficiently large k with qk+i > 6 2 '?'“ and any positive integer a < e’?'?'', 

d(r“5'''=(a;,y),(x,y)) <5, y{x,y) 

Proof. T°‘^^^{x,y) — {x,y) = [aSqkOt, H{aSqk,x)) . So it suffices to bound 
both ||a5gfca|| and \\H{aSqk,x)\\ by 

First, ||a5gfca|| < o5||gfca|| < < e4'?'=5e“2'?'“ = 5e“4'?'=. As S 

is independent of k, this is less than | when q^ is large enough. 


On the other hand, by Lemma 4.1 , 

\\H{aSqk,x)\\ 

aS—1 aS—1 

= H{qk,x + Iqk) = ^ {H{qk,x + Iqk) - qkH^)) + aSqkh{0) 


1=1 

aS-l 


1=1 


< ^ \H{qk,x+ lqk) - qkh{0)\ + \\aSqkh{0)\\ 
1=1 


<CaSe-i‘^>‘ + ||a5gfch(0)||, 

where C is a constant independent of k. By Lemma [0| , ||a5gfc/i(0)|| < | 
when k is sufficiently large. As for the first term, as a < , p < j and C, 

S do not depend on k, CaSe~i^'‘ g^gQ becomes less than 

I for large k. Therefore \\H{aSqk,x)\\ < |. This completes the proof. □ 


6. Averages in short intervals 


We continue to assume that ( ^ ) is true but (1.2) is not, and try to obtain 
a contradiction. 

Disjointness from p{n) is known for periodic sequences. And in light of 
the previous section, we know T"'{xo,yo) is not far from a periodic sequence. 
In this part we show that the error obtained there does not accumulate to 
destroy the disjointness. 

To begin with, remark that we can always assume there are infinitely 
many fc’s with qk+i > Otherwise, Corollary ( |3.3| ) allows to assume h 

is supported at {0}. Then T{x, y) = {x + a,y + /i(0)) is a rotation of T^, for 


which (1.2) is known to hold by Davenport’s theorem [ Dav37 |. 

From now on, we hx a function f{x,y) = e{f,ix + f, 2 y), which is allowed 


by Remark 2.1. It suffices to show for all 5 > 0 that, for sufficiently large 
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N, 


E /(T”(xo,yo))AtH < 5, 

n<N 


( 6 . 1 ) 


with an implied constant that is allowed to depend on h and /. 

Let rj and S are given by Lemma |5.2| . Suppose k is sufficiently large and 
satisfies qk+i > And set A = Furthermore, let Nq < ^- he an 

integer. 

The average (1.2) decomposes into averages in shorter intervals of length 
ASqk'. 

E n{n)f{T^{xo,yo)) 

n<N 

= V Kn)f{T^{xo,yo)) + oA 

L=No -/V 

N—1 L-\-ASgk — l ASqJc iVn 

= E E M(n)/(T"(xo,yo)) + 0(^) + 0(^) 

L=Nq n=L JV jV 

= ^^e' ^"eV(^ + a5gfc)/(r'+“^'?H®o,yo)) + 

L=No l=L a=o A A 


As o < A < , by Lemma |5.2| , for sufficiently large k, 

\f(T^+-s,,^xo,yo)) - f{T\xo,yo))\ < \\f\A6. 

Therefore, 


E fi{n)f{T'^{xo,yo)) 

n<N 

N-1 L+Sqk-lA-1 , ASoh Nn 

= E EE + aSqk)f{T\xo,yQ)) + 0{5) + 0{A- + ^). 

L=Nq l=L a=0 A iV 

For each L, construct a function : N i-A C by F£(n) = f{T^{xo,yo)) 
where I is the unique integer in [L, L + Sqk) such that I = n(mod Sqk)- Then 
Fl is periodic with period Sqk and \Fl\ = 1 , and 

E /i(n)/(T”(xo,yo)) 

n<N 

N-l L+ASq^-l , ASqk Ao, (6-2) 

= E E yi{n)FL{n) + 0{d + —^ + —). 

L=No n=L A A 

We next decompose yi{n)Fi{n) into short averages of Dirichlet 

characters. 


Lemma 6.1. For all L, Q, A G N and any periodic function F : N i— )> C of 
period Q with |F| < 1, 

E ^(n)F(n) 

L<n<L-\-AQ 

X mod* 


where the first average on the right hand side is taken over all pairs {d, x) 
such that d\Q and x is a Dirichlet character of conductor 
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Proof. We group n according to {n,Q): 


f^{n)F{n) 

L/^Th<C.L-\-AQ 

An)F{n) = J2 E 


fi{rd)F{rd) 


d\Q L/‘^Ti<fL-\-AQ 
{n,Q)=d 


d\Q 


(u#)=i 


(6.3) 


E Ard)F{rd) = J2Ad) E Ar)F{rd). 


(r S)=(r,d)=l 


d\Q 


{r,^)=(r,d)=l 


Here the requirement that (r, d) = 1 is because otherwise n = rd is not 
square-free and fJ-{rd) = 0. 

Identify {f’ ■ ^ ^ 1 F ~ with the finite abelian group 

(Z/(^)Z)^. The Dirichlet characters of conductor ^ form an orthonormal 
basis of the /^-space on this group with respect to the uniform probability 
measure. Hence F{rd)l(^r,d)=i can be decomposed as '^F,xX on this 

X mod* 

group. Then, 


Q 

d 


E \wF,x\‘^ = \\F{rd)l(^r,d)=i\\i^ <\\F{rd)\\ioo <1. (6.4) 

X mod* 2 


It follows from this and (^) that, 


E ^(n)F(n) 

L<n<L+AQ 


AQy 


E An)F(, 

Tj^n<C.LAAQ 


n] 


<- 


(AQy 


(AQy 


E E '^Fx E Ar)x{r) 

d\Q X mod* ^ J^<r<ji+AP 

E E ^ ^ ^ ,^Pir)x{r) 

d\Q X mod* ^ 


^<r<^+AS 


E E ^ Ar)x{r) 

'^IQ X mod* H d— d d 


We apply Cauchy-Schwarz inequality and get 
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E fi{n)F{n) 

L^n<C.L-\-jA.Q 





E E 

^ ^ ^ I a(4 

d\Q X mod* S d<'’< 


E Kr)x{r) 


S E ^ 

^d\Q J ydlQ ^ mod* ^ ^ 


E 

^<r<^+A^ 


^l{r)x{r] 


<<5 


E 

d\Q 
X mod* S 


E Ki")xir) 


^<r<^+A^ 


where the last inequality is because ^ converges and there are 

dirichlet characters x of modulus Q. 


In order to control 


d-' ^ d d 


, we make use of a recent theo- 


,r /x(r)x(r) 

rem from | ]MRT15 that allows to bound averages of a non-pretentious mul¬ 
tiplicative function in random short intervals. For this purpose, we recall 
the definition of pretentiousness first. 

For multiplicative functions v,u' : N i—>• C whose absolute values are 
bounded by 1, define 


[v,v',X) = I 

\p<x 


1 - Re{v{p)v'{p)) 


P 


The function below measures how closely pretends to be n 

M(u,X) = inf 
\t\<x 


it. 


One direction of Halasz’s Theorem |Hal68| asserts that if Yln<x ~ 
o{X), then for all t G M, 


lim D{u,n^^, X) = oo. 

X —^oo 

It is well known that (see e.g. [ IK04| , (5.80)]) for a Dirichlet character of 
conductor q and any C > 0, 


Y Kn)xin) <t:c qhlogX) 


(6.5) 


n<X 


we see that lim^-j-oo F>(/ix, 1,^) = oo for any given y. 


O'D 
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It follows from [ MRT15| , Lemma C.l], which is based on an argument of 
Granville and Soundararajan, that 

inf > j min (y^log logX, x, ^)) +0(1), 

\t\ 

or equivalently, 

1 


> - min (0oglog X, D{nx, 1, X)) +0(1). 


It follows that: 


lim M{^ix,X) = oo. 
X—>-oo 


( 6 . 6 ) 


The following proposition, due to Matomaki, Radziwill and Tao, is the 
key input in our method and will be applied to u(n) = /i(n)x(n). 


Proposition 6.2. [MRT15, Theorem A.l] Let v be a multiplicative function 
with \v\ < 1 and X > I > 10. Then 

E E R(n) < e~^‘^''X)M{n,X) + (logA)~^ + 

X<L<2X L<n<L+l V log/ / 

Proof of Theorem 0. Let J = |'log 2 |]. We choose Nq = [2 '^iVj, then 
^ < 6. Cut [Ao,A) into J dyadic intervals: [2 ~^for j = 

I,--- ,J. 

For each pair (d,y) and every j, by Proposition 6.2, 


E 

2-W<L<2-i+^N 


E h{r)x{r) 




</9 


2-i+iiV\ fioglog^^ 


(6.7) 


)^[ log 


ASqk 


where p{X) = Px{X) := M{pX:X) + (logX)“5o is a positive 

function that converges to 0 as A ^ oo, thanks to (|6.6D . However, we may 
make p independent of x (but dependent of S and qk), by taking p{X) = 
PSqkiX) := max^ mod Sq^Px^X)- Furthermore, by replacing psq^iX) with 
supx'>A PSqtiX') we may also assume psq,^ is decreasing. 

Recall that S and p are fixed constants determined by the dynamical 
system T. Notice d < Sqk, 2“-^+^ > 6, and A < Hence, 

f^X\ floglogAY f^X\ _2^ 2 

After combining the dyadic intervals, we have 

2 


N-l 

E 

L=No 


E 

d d 


p{r)xir) 


f^X\ _2, ^ 


( 6 . 8 ) 
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It now follows that 


E /x(n)/(r”(xo,yo)) 

n<N 

4 N -1 L+ASqu:-l / ASqk No 

E E Kn)FL{n) + 0(6+—p + ^ 
L=No n=L \ N N 


(Cauchy-Schwarz) / A^—1 

<C E 

\l=No 


L/-\-J\.SQf -—1 

E ^jL{n)FL{n) 

n=L 


1 

2 \ 2 


+ <5 + 


ASqk 

N 


(Lemma 6.1) 

<c 


/ 


7V-1 

Sqk E E 

L=No d\Sqk 

\ X mod* ^ 


L+ASqk—1 

E fJ-{n)x{n] 

n=L 


+ 5 + 


ASqk 

iV 


/ 


(S.8) / / 6N\ 

^ [Sqk-PSq^y-^J + Sq^^log-" qk^ +5 + 


^mk 


Sqk 


N 


Once 5 is fixed, for sufficiently large qk, 

Sg^Log^ qk < 5^. 

Fix such a qk that, in addition, verihes g^+i > then 


(6.9) 


E p{n)f{T^{xo,yo))<. (sqk-psqA^^ +<5" 

^<A'■ \ \bqk^ 


+ <5 + 


omk 


Sqk 


N 


Because 5, p, S and qk are now all fixed and psq^ is a function that 
decays to 0, for suffciently large N, Sqk ■ psq^ (^ ^ 

In consequence, (H is verified for N large enough. As <5 is arbitrary, 
(1.2) holds, contradicting the hypothesis so far. This completes the proof of 
Theorem 1.1. □ 
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